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Abstract. Increasingly, inheritance hierarchies are being used to reduce 
redundancy in natural language processing lexicons. Systems that utilize 
inheritance hierarchies need to be able to insert words under the optimal 
set of classes in these hierarchies. In this paper, we formalize this problem 
for feature-based default inheritance hierarchies. Since the problem turns 
out to be NP-complete, we present an approximation algorithm for it. We 
show that this algorithm is efficient and that it performs well with respect 
to a number of standard problems for default inheritance. A prototype 
implementation has been tested on lexical hierarchies and it has produced 
encouraging results. The work presented here is also relevant to other 
types of default hierarchies. 

Abstract. In zunehmendem Masse werden Erbschaftshierarchien zur 
kompakten Beschreibung von Worteigenschaften in Sprachverarbeitungslex- 
ica verwendet. Systeme, die Erbschaftshierarchien bemitzen, miissen Worte 
in die optimale Klasse (oder Menge von Klassen) der Hierarchien einfiigen 
konnen. Dieser Beitrag formalisiert das Problem fur merkmalsbasierte 
rfe/aiife-Hierarchien. Da das Problem NP-vollstandig ist, wird hier ein 
Approximationsalgorithmus vorgeschlagen. Es wird gezeigt, dafi der Al- 
gorithmus effizient ist und dafi er in Hinsicht auf einige Standardprobleme 
der rfe/aiiW-Hierarchien gut funktioniert. Eine Prototypimplementierung 
zeigt gute Resultate an einigen Hierarchien, die fur Lexica der Comput- 
erlinguistik geschrieben wurden. Diese Forschung kann auch fur andere 
Arten von rfe/ait^-Hierarchien angewendet werden. 

1 Introduction 

Recent computational linguistics research of natural language lexicons has gone 
beyond simply listing idiosyncratic information about words. Many researchers 
are adding structure to their lexicons to replace the old practice of simply listing, 
in an entry for a word, all its properties. This structure often takes the form of 
feature-based default inheritance hierarchies (e.g. @, ||). After such hierarchies 
have been designed, the task of placing word entries under the appropriate classes 

* This work was funded by NSF grant IRI-9013160, ONR/DARPA research grant no. 
N00014-92-J-1512, and a grant from the Land Baden- Wiirttemberg, Germany. 



begins. The problem we address in this paper is deciding where to place a new 
object in an inheritance hierarchy — we will call this 'the insertion problem.' This 
problem has received little attention in the literature; to our knowledge this paper 
is the first to formalize the problem and to provide a general algorithm for it. 
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Fig. 1. Verb feature hierarchy (adapted from W) 

An example of the insertion problem is the task of placing a verb into the 
hierarchy presented in Fig.l. The upper case symbols (e.g., CONTROL) are class 
names and the pairscl written directly below these names represent features that 
can be inherited from them. For example, all the features of INCOMPLETE are 
inherited by EQUI — unless overridden by features in CONTROL or in EQUI. 
Consider the form of give exemplified in she gave him the ball and assume that 
it has the following features. 

1. [subj /cat ,N] [subj/case,nom] [subj /complete, +] [dobj/cat,N] 

[dobj/case,acc] [dobj/complete,+] [iobj/cat,N] [iobj/case,acc] [iobj/complete,+] 

A solution to this instance of the insertion problem would be to place give 
under the TRANSITIVE and 3-1 classes. By doing so, all of the features listed 
above could be inherited except [iobj /cat, N]. This feature would have to be 
listed directly in the entry for give so that the incorrect default inheritance 
of [iobj/cat,P] from the 3-1 class would be blocked. Thus, only three pieces of 
information need to be listed in the entry for give: it inherits from TRANSITIVE, 
it inherits from 3-1, and it has the feature [iobj/cat,N]. Notice that if we had 
inserted the entry under any other set of classes, more information would have 



2 Both elements of these pairs are atomic symbols. Here and throughout this paper, the 
'/' has only mnemonic relevance. 



had to be listed in the entry. A central characteristic of a good insertion is that 
it minimizes the amount of information that needs to be listed in the entry. 

In this paper, we will focus on feature-based default inheritance hierarchies: 
feature-based in that attribute/value pairs are used to encode characteristics of 
a class and default in that all inheritance relationships are defeasible. We will 
also assume that multiple inheritance is allowed: classes can inherit from more 
that one superclass. However, the results described in this paper are relevant 
to any system that utilizes defaults and for which inconsistency between two 
elements that are inherited can be discovered efficiently. 

The rest of this paper is structured as follows. First, we give an informal 
characterization of the insertion problem and argue that this characterization 
captures the relevant aspects of the problem. Second, we formalize this charac- 
terization. The problem, so defined, is NP-complete || and, thus, it is unlikely 
that a computationally tractable algorithm for its solution will be found. Next, 
we describe an approximation algorithm for the problem and through complexity 
analysis and discussion of preliminary experimentation, we argue that this algo- 
rithm produces reasonable results in an acceptable amount of time and space. 
The algorithm has been implemented and has produced good results in experi- 
ments involving inserting words into lexical hierarchies. 

2 Informal Characterization of the Problem 

As mentioned above, a good insertion places an object under classes from which 
it can inherit most of its features. In addition, it should use as few classes as 
possible to achieve this goal. These two requirements are aspects of a basic prin- 
ciple of classification schemes: reduce redundancy. A corollary of this principle is 
that a good insertion should minimize the amount of information stored in the 
entry for the object being inserted and thereby maximize the use of the informa- 
tion contained in the inheritance relationships. Three types of information are 
stored in an entry: its superclass (es), the features that are not inherited from any 
superclass, and the features that are needed to block an incorrect inheritance. 
Thus, an optimal insertion has the smallest possible value for the following sum 
(which we will refer to as the cost of the insertion or solution). 

2. number of superclasses + number of object features not in these superclasses + 
the number of object features that must be listed to block incorrect inheritance 

At first glance, one might think that one easy way to find an optimal solution 
is to start at the roots of the hierarchies that make up a structured lexicon and 
simply walk down these hierarchies, pruning off the branches below a class that 
contains a feature that the object does not have (since all classes below this 
class will also have this feature). This approach will work for strict inheritance 
hierarchies. Strict hierarchies do not allow default inheritance — all the subclasses 
below a given class have to inherit all its features. The algorithm outlined above 
uses this characteristic to cut down the search space for superclasses. However, 
default hierarchies do not have this characteristic: if a class A inherits from a 
class B, the set of features associated with A might not be a superset of the set 
associated with B. Thus, an insertion algorithm for default hierarchies cannot 



ignore A as a possible superclass for an object simply because it has decided 
B is unsuitable; A might be an exception to B in just the right ways. Because 
of these considerations, for the purposes of insertion, we make the following 
claim: each class in a default hierarchy should be viewed as the set of features 
that can be inherited from this class. This claim is central to the approach 
to insertion taken in this paper. It amounts to 'compiling out' the inheritance 
relationships so that a hierarchy becomes a set of sets of features. Compiling out 
the inheritance relationships is a process of pushing features down the inheritance 
links to the classes below so that all the features of a class are explicitly listed 
in the data structure for the class. For example, compiling out the hierarchy in 
Fig. 1 would produce the set of sets listed in (||). The first set corresponds to 
the COMPLEMENTATION class, the second to INCOMPLETE, the third to 
COMPLETE, and the fourth to TRANSITIVE. 

3- {{} 

{[complete,-] [subj/cat,N] [subj/case,nom] [subj/complete,+]} 
{ [complete, +]} 

{[complete,-] [subj/cat,N] [subj/case,nom] [subj /complete, +] 
[dobj/cat,N] [dobj/case,acc] [dobj/complete,+]} .... } 

By compiling out a hierarchy, one loses inheritance relationships between 
classes. However, since a good insertion minimizes the space needed to store the 
properties of the object being inserted, with respect to insertion, a class is simply 
a chance to save space by storing one class name instead of a number of features. 
Thus, what features can be inherited from a class is the only characteristic of a 
class relevant to insertion. As a result, when dealing with the insertion problem 
in default hierarchies, one should think of the hierarchy as a set of sets, each 
of which corresponds to a possible superclass of the object being inserted. The 
insertion problem, then, amounts to picking an optimal subset of this set of sets; 
the definition of optimal remains the same. 



3 Formal Definition 



In this section, we will formalize the intuitive characterization of the previous 
section. The first task is to give a formal characterization of the features we 
have been discussing informally in the previous section. A feature is a pair of 
atomic symbols (e.g. [a,v]). The first element is taken from a set of attributes 
(ATTRIBUTES) and the second from a set of values (VALUES); these sets may 
be infinite. The set VALUES includes a symbol ? which intuitively specifies 
that the corresponding attribute is undefined or unknown for the object with 
the feature, more on this later. Two features clash (i.e., are inconsistent) if their 
attributes are the same but their values are different; this definition also holds for 
the value ?. Note that testing if two features clash can be done in constant time. 
We will use clash to denote a binary function on sets of features that produces 
the subset of features from the first set that are in conflict with a feature of the 
second (see below). 



4. A = {[al,vl][a2,v2][a3,v3][a4,v4]} 

B = {[al,v5][a2,v2][a3,v20][a7,v7][a9,vl2]} 
clash(A,B) = {[al,vl][a3,v3]} 

We assume that the sets of features are internally consistent, i.e., each set con- 
tains at most one feature per attribute. clash(C,'D) can be computed for the 
finite sets C and T> in 0(|C||Z>|) time by simply testing every pair formed from 
an element from C and an element of T> for a clash. 

Although we will work with this specific feature system, the algorithms and 
proofs that we present here, generalize to any system for representing what is 
inherited with the following property: whether two inheritable elements clash can 
be computed in time polynomial in the size of the elements. Features systems 
used in natural language processing often have this property (i.e. polynomial- 
time unification algorithm). 

The next task is to further specify the hierarchies to be used. As mentioned 
above, we are concerned with feature-based default multiple inheritance hierar- 
chies. In addition, we assume that the hierarchies are unambiguous: for any given 
attribute, a node only inherits one value for it. We will discuss the issue of am- 
biguity with respect to insertion below. We also assume, for expository reasons, 
that the feature sets inheritable from the classes are unique: that a given set of 
features cannot be inherited from more than one class in a hierarchy. Further, 
we are concerned with cautious insertion: only the known features of the object 
being inserted should be inherited. An adventurous insertion would allow an ob- 
ject class to inherit extra features from its superclasses. In order to deal with the 
problem of inheriting extra features from superclasses, we will use the value ? 
in features to block unwanted inheritance, we will call such features '?-features.' 
Only object classes are allowed to contain ?-features. Fig. 2 illustrates both forms 
of insertion and the use of the ? value. In adventurous insertion, Objectl would 
have the feature [a3,v3] whereas with cautious insertion, its inheritance would 
be blocked by [a3,?]. 

In the previous section, we defined the insertion problem so that solutions 
could be incomplete: not all the features of the object being inserted need be 
inherited. Of course, a solution pays a price for forcing a feature to be listed in the 
object class. The second term of the sum in (||) specifies the price paid. Because 
of the possible existence of clashes (the third term), an optimal solution may be 
incomplete. Such incomplete insertions complicate the reduction needed to prove 
NP-completeness and the statement of the insertion algorithm. To circumvent 
this difficulty, we assume that all hierarchies contain a singleton class for each 
non-?-feature of the object being inserted. (We will leave these sets out of the 
figures of this paper.) A singleton class is a class from which exactly one feature 
can be inherited and which has no superclasses and no subclasses other than 
object classes. Listing a singleton class as a superclass in an object class is 
equivalent to simply listing the feature of the singleton class. This assumption 
allows us to require complete insertions while still being able to represent the 
notion of incomplete insertion. 
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Fig. 2. Adventurous vs. cautious insertion 

An instance of the insertion problem (IN) is a pair made up of a set T and 
a set Af as defined below. Note that only a finite number of features is needed 
to specify T and that there is a one-to-one mapping between the sets in Af and 
the classes of the hierarchy. 

T contains the features of the object being inserted. T is complete in the sense that 
for all attributes in ATTRIBUTES there exists a feature in T that contains a value 
for that attribute. However, T only has a finite number of features that contain 
values other than ?. 

Af is a set of sets of features. AT is finite and each of its elements is finite. 

If we return to our discussion of the hierarchy in Fig.l, we see that the set 
of features of give, listed in (Q), plus the appropriate ?-features, is T for this 
hierarchy. The set of sets of features in (||) that resulted from compiling out the 
hierarchy, plus singleton sets for the features of J- : is Af. 

A solution is a set V C Af. V represents the superclasses for the object whose 
features are fF . Every non-?-feature in T must be an element of either a set in V 
or clash^, (J'P).El The features in this latter set have to be listed explicitly for 
the object in order to override incorrect inheritance. An optimal solution mini- 
mizes the number of superclasses together with the number of clashes between 
these superclasses and T: \V\ + \clash(!F, \JV)\. Finally, the insertion problem, 
IN, is the problem of finding optimal solutions for instances as defined above. 

4 An Approximation Algorithm for IN 

As mentioned above, IN is NP-complete. Thus, the best one should hope for 
is a computationally tractable algorithm that produces near optimal solutions. 

3 [J<P where $ is a set of sets ip\, (pn, <p n is equal to ipi U <p U ... U Lp„. 



Part of the NP-complctcncss proof is a relatively straightforward reduction of 
the NP-complete set covering (SC) problem. SC is defined as follows 

"...the set-covering problem consists of a finite set T and a family M of 
subsets of J 7 , such that every element of T belongs to at least one subset 
of M ... We say that a subset S 6 M covers its elements. The problem is to 
find a minimum-size subset V C M whose members cover all of J 7 ." (Q, 
p. 974, we have substituted variable names analogous to those in IN). 
IN can be viewed as extending SC in two ways. First, it substitutes features 
for integers. Features introduce the possibility of clashes between elements. This 
makes the process of choosing V more complex since one must take into ac- 
count features clashed with as well as features covered. Second, intuitively, IN 
loosens the restriction that all non-?-elements in T must be covered. However, 
our formalization technically requires complete coverage but retains the ability 
to represent incomplete covers since listing a singleton set in the object class is 
equivalent to listing an uncovered feature. 

A polynomial time approximation algorithm exists for SC and the solutions 
it produces are guaranteed to be close to the optimal solution. More formally, the 
ratio of the size of the approximate solution produced by the algorithm to the 
optimal solution is bounded by the natural logarithm of the size of the set being 
covered ||: \P a pprox\/\'P ptimai\ < ln\J- \ + 1. This approximation algorithm is 
greedy: at any given point, it picks the subset that can cover the most features 
at that time and it never goes back on this choice. Greedy algorithms tend to be 
fast but, since they cannot backtrack, they can make local choices that prevent 
globally optimal solutions. Because of the similarities between SC and IN, it 
seems likely that a greedy approximation algorithm will produce good solutions 
for IN. Just such an algorithm, Greedy-IN, is listed below. 

1. Ftemp ■= {[a,v]\[a,v] £ T Av ?} 

2. V := 

3. Tdash '■= 

4. While Ftemp 

5. select S £ N that maximizes \S n J r temp\ — \clash(T, S) — T c iash\ 

6. Ttemp := ftemp — (S U clash{T , S)) 

7. Fdash := Tdash U clash{T, S) 

8. P:=PU{S} 

9. return 

10. list V and T c i aa h in the data structure for the object 

Greedy-IN takes as input the set of features of the object being inserted, 
and a set of sets of features, TV, that represents the classes of the hierarchies. 
It produces as output, a list of superclasses and a list of features that must be 
listed locally for the object. During each iteration of its main loop (lines 4-9), 
it picks the most suitable superclass (line 5). The features from T that can be 
inherited from the new superclass S combined with those that must be listed to 
block incorrect inheritance are subtracted from T . This loop is repeated until 
there is no class in M from which more features can be inherited than must 
be listed to block incorrect inheritance. The algorithm returns the superclasses 
along with features that must be listed to block incorrect inheritance. To pick 
the most suitable superclass, a subroutine is called for each class remaining in 



AT that computes the difference between the number of features remaining in T 
that can be inherited from the class and the number of new features that would 
be incorrectly inheritedu 

5 Analysis of the Algorithm 

As expected, Greedy-IN runs in time polynomial in the size of the encoding 
of the input pair (T,Af), more specifically, 0(mm{|jF non _?|, |A/"|}|^ r ,ion-?||A/"|) 
where T non - r > represents the set of non-?-features in T . The space complexity 
is also polynomial in the size of the input: 0(max{\J 7 non -?\, \N\\S ma x\}) where 
Smax is the largest element of N '. 

Greedy-IN has been implemented and tested on a number of DATR || lexical 
hierarchies. Testing consisted of enumerating the features of a small number 
of words by hand, inserting the words into the hierarchies by hand, running 
Greedy-IN on the words, and then comparing the insertions. Greedy-IN was 
extremely fast and produced insertions that in most cases were very close to 
those done by hand. Although these results are encouraging, we can, at this 
point, offer neither extensive experimental results showing good performance 
nor a theoretical proof of a logarithmic ratio bound for Greedy-IN. Instead, we 
will discuss the basic situation that causes sub-optimal results to be produced 
by greedy algorithms. In addition, we will discuss how the algorithm performs 
with respect to two standard problems for default inheritance hierarchies taken 
from |Q: redundant links and nixon diamonds. A system that utilizes default 
inheritance hierarchies must address these problems since, if left unattended, 
they can produce inconsistencies and/or unexpected behavior in the system. 

An example of the basic sub-optimal solution producing situation is presented 
below. 

5. A: [al,vl][a2,v2][a3,v3][a4,v4] 

B: [al,v5] [a2,v2] [a3,v20] [a7,v7] [a9,vl2] 
C: [a3,v3][a4,v4][a6,v6] 

T = {[al,vl][a2,v2][a3,v3][a4,v4][a5,v5][a6,v6]} 

The problem here is that class A is seductive; it covers a large number of features 
from J 7 , more than B or C. However, if A is chosen it is still necessary to pick 
B and C or the two appropriate singleton sets to cover the features not covered 
by A: [a5,v5] and [a6,v6]. If an algorithm can withstand the temptation of class 
A and instead go with B or C, this algorithm is rewarded by simply choosing 
the other class (either B or C) to cover the rest of T . However, Greedy-IN falls 
for sets like A every time. One can tweak the selection criteria for the winning 
node (line 5) to produce an optimal solution for any given instance. However, in 
general, it is always possible to come up with an instance for which these new 
criteria produce a sub-optimal result. This is due to the fact that the central 

4 For some applications, features may not be equally important with respect to inser- 
tion. In such situations, features could be assigned different weights to account for the 
difference and in line 5 such weights would produce different payoffs and penalties. 



characteristic of the algorithm stays the same: decisions are made with only 
local information and no backtracking is performed. Note that such problematic 
instances do not seem to cause Greedy-IN to produce wildly incorrect solutions, 
simply slightly sub-optimal ones. 

Another situation that is problematic for Greedy-IN is illustrated in Fig. 3. 
The compiled-out set, A/", (minus the singleton sets) for this hierarchy is listed 
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[al,vl] [a2,v2] [a3,v3] 
[a4,v4] [a5,vS] [a6,v6] 
[a7,v7] 



B 

[»5,vl2] [a6,vl3] [a7,vl4] 
[a8,v8] [a9,v9] [alO.vlO] 
[all.vll] [al2,vl2] 

T — {[al.vl] [a2,v2] [a3,v3] [a4,v4] [aS,vS] 

[a6,v6] [a7,v7] [a8,v8] [a9,v9] [alO.vlO] 
[all.vll] [al2,vl2]} 



Fig. 3. Redundant link producing situation 



in (|6|) where the first set corresponds to A and the second to B. 

6. N ={{[al,vl] [a2,v2] [a3,v3] [a4,v4] [a5,v5] [a6,v6] [a7,v7]} 

{[al,vl] [a2,v2] [a3,v3] [a4,v4] [a5,vl2] [a6,vl3] [a7,vl4] 
[a8,v8] [a9,v9] [alO.vlO] [all,vll] [al2,vl2]}} 

During the first iteration of the main loop of Greedy-IN, A will be selected to 
be a superclass since it will have a score of 7 whereas B will only have a score of 
6. In the next round, however, B will be chosen since it will then have a score of 
2 which is higher than the scores obtained by the singleton sets. At this point all 
the features of T will be covered. Thus, the insertion for the object will list A and 
B as parents and the following features [a5,v5], [a6,v6], [a7,v7]. The link from the 
object class to A is known as a redundant link because there is already a path 
to A through B. However, notice that the features that end up on the object 
class are the correct ones; no inconsistencies exist. This is a result of the way 
clashes are handled. Notice also that the solution is sub-optimal. The optimal 
insertion would list the same three features but only inherit from B; the link to 
A could be removed without changing the features of the object class. Contrary 
to our earlier claim that each class in a default hierarchy should be viewed as 
the set of features that can be inherited from this class, it appears that Fig. 3 
illustrates a case where the inheritance links between classes are relevant for 
insertion. One approach to this problem is to use weights on features in the sets 
of M to encode the relevant hierarchical structure of the classes. These weights 
would be added during the compilation process; the weight of a feature would 
start at 1 in the class where it originates and would then increase slightly each 



time it is passed down to a lower class. In line 5 of the algorithm, the weights 
would cause lower classes would be favored over higher ones. Alternatively, a 
post-processing algorithm could be used to search for and eliminate redundant 
links. In any case, redundant links are a problem for Greedy-IN as it currently 
stands. 

The second standard problem for default inheritance hierarchies occurs when 
an object class inherits from two classes each of which has a different value 
for the same attribute. This problem is known as the nixon diamond and the 
standard example is that Richard Nixon was both a Republican and a Quaker. 
Republicans believe military force has to be used sometimes whereas Quakers do 
not. Thus, it is unclear what features the object class Nixon should have. The 
question here is whether Greedy-IN ever produces such situations. The answer is 
no. Because clashes are handled by listing locally the correct feature and because 
T is filled out by ?-features, it is not possible for a Nixon diamond to result from 
a Greedy-IN insertion: when inserting Nixon, regardless of which class is chosen 
first, [miluse,?] will be listed in the Nixon object class, thereby blocking any 
inheritance of a feature with the miluse attribute. 

6 Conclusion 

We started this paper by discussing informally the problem of deciding where 
an object belongs in a feature-based default inheritance hierarchy. This is an 
important problem but one that has, to our knowledge, received little attention 
in the literature. We formalized this problem as IN. A crucial aspect of this for- 
malization is that the hierarchy is viewed as a set of unrelated sets of features. 
Two facts combine to support this claim: i) a good insertion minimizes the space 
needed to store the object and thus, a class is seen as an opportunity to replace 
the listing of a number of features by a single superclass, ii) the structure of a 
default hierarchy cannot be used to reduce the search space of potential super- 
classes. Because IN is NP-complete, we designed an approximation algorithm 
for it. We showed that this algorithm is efficient and that it performs well with 
respect to a pair of standard problems for default inheritance. 

7 Acknowledgements 

The ideas about insertion discussed here grew out of the need for a way to 
move data from the ELWIS database to inheritance hierarchies. The INSYST 
project prototype mentioned above is a first attempt at implementing such a 
system. The INSYST project was a joint effort between Sabine Reinhard, Marie 
Boyle- Hinrichs, and myself. Marie Boyle-Hinrichs was responsible for the imple- 
mentation of the INSYST prototype. I would like to thank both of them for their 
efforts and their support of my own. 

Both ELWIS and INSYST were developed at the University of Tubingen 
under the supervision of Erhard Hinrichs. I would like to thank him for the 



opportunity to spend a summer in Tubingen working on these projects. I would 
like to thank Lenhart Schubert for reading many versions of this report and 
providing invaluable comments. I would also like to thank Chris Barker, George 
Ferguson, Dafydd Gibbon, Sabine Reinhard, and Mark Young for their comments 
on earlier versions of this report. Finally, I would like to thank Ycnjo Han, 
Leonidas Kontothanassis, Jeff Schneider, and Paul Dietz for their help with the 
initial design and analysis of Greedy-IN. 

References 

[1] T. Briscoe, V. de Paiva, and A. Copestake. Inheritance, Defaults, and the Lexicon. 

Cambridge University Press, 1993. 
[2] T. Cormen, C. Leiserson, and R. Rivest. Introduction to Algorithms. MIT Press, 

1990. 

[3] R. Evans and G. Gazdar. The DATR Papers. Technical report, U. of Sussex, Cog. 

Sci. Research Reports, 1990. 
[4] D. Flickinger. Lexical Rules in the Hierarchical Lexicon. PhD thesis, Stanford, 

1987. 

[5] M. Light. Classification in Feature-based Default Inheritance Hierarchies. Techni- 
cal Report 473, Department of Computer Science, University of Rochester, 1993. 

[6] L. Lovasz. On the ratio of optimal integral and fractional covers. Discrete Mathe- 
matics, 13:383-390, 1975. 

[7] D. S. Touretzky. The Mathematics of Inheritance Systems. Morgan Kaufmann, 
1986. 



This article was processed using the I^TgX macro package with LLNCS style 



